Tsallis entropy composition and the Heisenberg group 



NIKOS KALOGEROPOULOS § 

Weill Cornell Medical College in Qatar 
Education City, P.O. Box 24144 
Doha, Qatar 



Abstract 

We present an embedding of the Tsallis entropy into the 3-dimensional Heisenberg group, in 
order to understand the meaning of generalized independence as encoded in the Tsallis entropy 
composition property. We infer that the Tsallis entropy composition induces fractal properties 
on the underlying Euclidean space. Using a theorem of Milnor/Wolf/Tits/Gromov, we justify 
why the underlying configuration/phase space of systems described by the Tsallis entropy has 
polynomial growth for both discrete and Riemannian cases. We provide a geometric framework 
that elucidates Abe's formula for the Tsallis entropy, in terms the Pansu derivative of a map 
between sub-Riemannian spaces. 



PACS: 02.10.Hh, 05.45. Df, 64.60.al 

Keywords: Tsallis entropy, Nonextensive entropy, Heisenberg group, Abe's formula, Volume growth. 



§ E-mail: nik2011@qatar-med.cornell.edu 



1. Introduction 



The Harvda-Charvat [1], Daroczy [2], Tsallis [3], [4] entropy is a relatively recently in- 
troduced entropic form in Statistical Mechanics, which has attracted considerable attention 
during the last quarter of a century ([4] and references therein). The considerable interest in 
the (henceforth simply called) Tsallis entropy can be partly attributed to the recent investi- 
gations on the foundations of Statistical Mechanics, in particular to the investigations on the 
equivalence (or lack of) of the classical equilibrium ensembles [5], [6]. One reason motivating 
such investigations lies with systems with long-range interactions [7], where in particular the 
concepts of probabilistic independence, and subsequently of the additivity and extensivity of 
entropy have to be reconsidered [8], [4]. 

The potential significance of the Tsallis entropy for High Energy Physics and Gravitation 
could turn out to be almost as high as for Statistical Mechanics. After all, quantum field theories 
[9] are quantum theories, so they do have a statistical interpretation. Moreover and they can 
be seen as arising from self-averaging of appropriate random walks, at least if one is concerned 
about the behavior of theories around their Gaussian fixed point [9], [10]. It is evident that 
the covariant /path-integral quantization of field theories, even at zero temperature, essentially 
relies on the BGS entropy and its concomitant concepts [9], [10]. The derivation/formulation of 
the partition function, initially at the level of discretized variables, assumes weak correlations 
between adjacent spatial and temporal configurations [9]. This may be sufficiently adequate for 
weakly coupled theories with short-range interactions, but its use presents a huge challenge in 
strongly coupled theories or theories with long-range interactions as has been noticed before, 
at least since Gibbs [11]. 

In this vain one may try to appropriately treat gravity in the Tsallis entropy induced context, 
since gravity is a long-range interaction, from a Newtonian viewpoint. The perturbative formu- 
lation of General Relativity around the Minkowski vacuum has resulted in a non-renormalizable, 
not even Borel-summable, theory [12], [9]. This certainly calls for a different treatment in the 
covariant quantization of gravity: a potential attempt toward this goal may involve using a 
Tsallis entropy induced functional integral rather than the one that is currently employed. 
More radical proposals certainly exist (loop gravity [13], causal sets [14], causal dynamical 
triangulations [15], strings/branes [16], [17] etc) and each has its own successes, which could 
possibly be enhanced by looking at them under the prism of the Tsallis, rather than the BGS 
entropy. In a more restricted context, the Tsallis entropy should be considered as a potential 
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candidate in the searches for the statistical origins of the black hole entropy [18], [19]. This 
is a point of tremendous interest in gravitational physics during the last four decades, which 
despite our best efforts has so far eluded a universally acceptable explanation [19]. 

In our prior work [20], [21] we addressed the (arguably) biggest difference between the BGS 
and the Tsallis entropies: the way that independence and additivity is defined through each one 
of them. We concretely compared these concepts via a comparison of the composition proper- 
ties of the BGS and the Tsallis entropies. To that end, we put the usual BGS induced additivity 
and the generalized Tsallis-induced additivity side-by-side and established a Riemannian met- 
ric reflecting the differences between these composition properties [21]. Using this approach 
we have been able to explain, for systems described by the Tsallis entropy, why the largest 
positive Lyapunov exponent of the underlying dynamical system vanishes [22], derive geomet- 
ric interpretations of the nonextensive parameter [20], [21], justify the use of the escort [23], 
rather than the naively expected [24], probability distributions in applying of the maximum 
entropy principle [4], [24], and argue why the configuration/phase space of such systems grows 
at a power-law/polynomial rate as a function of the system's number of degrees of freedom [24]. 

In the present work, we take a different path in comparing the concept of independence as 
encoded in the BGS and in the Tsallis entropy composition properties. In Section 2, and to 
keep things as simple as possible and still have composition and inversion at our disposal, we 
use simplest familiar, to us, algebraic structure possessing these operations: linear groups. To 
that end, we embed the Tsallis entropy into the the set ilj^ 3 of 3 x 3 real, upper triangular 
matrices (with units in its principal diagonal). By examining the corresponding Lie algebra u, 
we see that ilj^ x3 is a matrix realization of the 3-dimensional Heisenberg group. We explicitly 
check that il^* 3 is indeed 2-step nilpotent and provide the proof that u is only possible 
other Lie algebra except the Abelian one which is nilpotent in 3 dimensions. Section 3 contains 
several loosely inter-related subsections which draw conclusions from the structure of Section 2 
with various degrees of relevance to the Tsallis entropy. More concretely, subsection A provides 
an alternative realization of the Heisenberg group via a semi-direct product construction. Sub- 
section B discusses and compares the Heisenberg with the Abelian actions from the viewpoint 
of integrability. Subsection C explores the effects of the contact distribution via non-trivial 
holonomies and isoperimetric comparisons. It also addresses the fractality of the Heisneberg 
group. Subsection D gives a definition of the sub- Riemannian/Carnot-Caratheodory distance 
function. Subsection E discusses the bi-Lipschitz equivalence and uniqueness of the distance 
functions. Subsection F discusses the dilations of the Heisenberg group. Subsection G elabo- 
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rates somewhat upon the role of dilations in the Heisenberg geometry. Subsection H discusses 
the Pansu differential in the restricted context of interest and provides a general framework 
for Abe's formula of the Tsallis entropy via the Jackson derivative. Subsection I establishes 
the power-law growth of the configuration/phase space for systems described by the Tsallis 
entropy, both for the cases of discrete and Riemannian phase spaces by using the fundamental 
theorem of Milnor/Tits/Wolf/Gromov on the polynomial growth rate of discrete groups. Sec- 
tion 4 concludes and presents some speculations for the implications of these constructions for 
Statistical Mechanics and Quantum Physics. 

The present work is a considerably expanded version of [25], providing motivations, details 
and explanations. 

2. Tsallis entropy and the Heisenberg group 

The Tsallis entropy S q [3] , [4] is a single-parameter family of entropies, labelled by a real 
number geR called nonextensive/entropic parameter. For a set of probabilities labelled by 
the discrete index set /, it is given by 



where k B is Boltzmann's constant. In case of a probability distribution on a continuous space 
with corresponding probability density p : Q — > R+, the analogue of (1) is 



where dfi is a Borel- regular measure on the underlying space Q. Technically, although it is 
sufficient for the definition of the Tsallis entropy for Q to be a space endowed with a measure 
H in most cases of physical interest Q is also endowed with a metric structure. Probably the 
most commonly encountered class of such spaces are Riemannian manifolds (M, g) for which 
dfi = cIvoIm is the Riemannian volume element uniquely associated to g. It is immediate 
that 




(1) 




(2) 



q-H 

where Sbgs stands for the BGS entropy, given in the continuous case by 



lim = Sbgs 



(3) 




(4) 
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Henceforth we will be setting ks — 1 for simplicity. 



Statistical independence is conventionally defined for two subsystems Mi,M 2 C M by 

Pm 1 *m 2 — Pm x ' Pm 2 (5) 

where Mi*M 2 indicates the combined system resulting from the interactions of Mi and M 2 . 
The Tsallis entropy (1) is not additive if we accept this definition for independent systems, but 
it instead obeys 

S q [pM 1 *M 2 ] = S q [p Ml ] + S q [p M2 ] + (1 - q) S q [p Ml ] S q [p M2 \ (6) 

To make the Tsallis entropy explicitly additive, one [26], [27] re-defines the concept of "indepen- 
dence" by introducing what is essentially a modified Abelian group structure whose addition, 
reflecting (4), is given by 

x ® q y = x + y + (1 - q)xy (7) 

where x, y e M. Our aim is to explore consequences of the generalized definition of indepen- 
dence as encoded in (7). To that end, we start by a slight change of variables 

S q [p] = (1 - q)S q [p] (8) 

Then (6) becomes 

S q [pM 1 *M 2 ] — SqiPMi] + S q [pM 2 ] + ^[PmJ S q [pM 2 ] (9) 

which amounts to inducing a modified version of the generalized addition by 

x® q y = x + y + xy (10) 

We introduce an injective map S : K. — >■ ilj^ x3 whose target space is the single-parameter subset 
& of the set of 3 x 3 upper-triangular matrices ilj^ x3 having real, equal off-diagonal elements. 
Concretely, 



S(x) = 1 x \ (11) 




and & = {B(x), x e R}. Consider the usual matrix multiplication 

1 x + y x + y + xy 
S(x) S(y) = j 1 x + y ' | (12) 

1 
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which using (10) can be re- written as 

(1 x + y x® q y 
1 x + y \ (13) 
1 

We see the map S is an embedding of M into il^ x3 which allows a comparison between the 
ordinary and the generalized addition (10) by looking at the non-trivial off-diagonal elements 
of (11). It may be worth noticing that & is not a subgroup of ilj^ x3 since it is not closed 
under multiplication. We will not address the question of the properties that & has as a 
subset of iljjj^ 3 in this work. Instead we continue observing that as the group multiplication 
is probably the simplest form of composition, it is a highly desirable feature for our purposes. 
In order to have available such a composition property, we will be working from now on inside 
the ambient space il 3 ^ 3 which is 

x y 

il 3x3 = { I 1 z I , x,y,z ei } (14) 




1 

(3x3 



Inside Ujj , the inverse matrix of §(x) is given by 

/ 1 -x 

(S(x))- 1 = 1 -x I (15) 




and the group-theoretical commutator 

[«(*),%)] = «(*) %) (S(x))- 1 (S( l/ ))- 1 (16) 



is 





[§(*),%)] =01 (17) 



Since xy ^ 0, & is not Abelian. Actually, it would be quite surprising if it were, as this would 
imply that the generalized addition (10) would essentially be the same as the ordinary addition. 
In turn, that would imply that the Tsallis entropy composition property (6) is essentially the 
same as the ordinary addition. Subsequently, the sets of axioms [28] - [30] would imply that 
the Tsallis entropy is just the BGS entropy, something which is clearly false. In this fomalism, 
the origin of the difference between the ordinary and the generalized addition (10) is exactly 
that xy in (17). Now, for z G K consider the commutator 

[§(z),[§(x),§(y)}} = l 3x3 (18) 
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where the right-hand side indicates the identity element of ilj^ x3 . This expresses that the sub- 
group of il^ x3 inside which & is embedded, is 2-step nilpotent. 



To determine the Lie algebra u of i! 3 ^ 3 
l 3x3 , which gives 



we find the differential at the identity element 



u = 





a 




((: 





: ) 







o / 



a,b,c G 



(19) 



We immediately verify that u has as a basis 



1 
A = 10 




Y 




Z = 




(20) 



The Lie algebra u has Lie bracket [A, B] = AB — BA, V A, B e u. The only non-trivial 
commutator is 

[X,Y] = Z (21) 

with all other commutation relations being zero, as is readily seen. This is immediately recog- 
nized as the Heisenberg commutation relation familiar from quantum Physics, where Z is an 
element in the center of u. Then the group ilj^ 3 in (14) is the Heisenberg group with Lie 
algebra u, and the elements of the former can be obtained by the elements of the latter via 
the exponential map. It is worth noticing that the ambient space of i4 x3 is R 3 and that the 
corresponding Killing-Cartan metric is zero, as it is for any nilpotent group. 



First, we make a comparison with the induced structure of the BGS entropy in this language. 
For the BGS case, we only need the ordinary addition to express its associated concept of 
"independence". Hence, the map analogous to (11), would be 



(22) 



S E {x) = 

and the general matrix group of interest, analogous to (14) is 




3x3 




x,y,z E 



(23) 
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which is the group of translations of IR 3 . Its Lie algebra D has generators 





X E = , Y E = ( 1 | , Z £ = | | (24) 
\ / 

and is, obviously, commutative. 




Second, we try to understand the difference, in the present formalism, between the ordinary 
and the generalized additions reflecting the different views about independence and additivity 
that are encoded in the BGS and the Tsallis entropies. In our previous work such a comparison 
was encoded through metrics and was made concrete by the contrast between a Euclidean (for 
the BGS case) and a hyperbolic (for the Tsallis case) metric. The universality of the Tsallis 
entropy, expressed in the axioms of [28] - [30] was expressed in the metric formalism via the 
Hadamard-Cartan theorem. In the present work, we expressed above the BGS and the Tsallis 
entropy composition properties in a simple algebraic way by using the embedding (11). Then 
the difference between the BGS and the Tsallis entropies is reflected via the comparison of the 
Abelian with the Heisenberg Lie algebra (21). The universality of the Tsallis entropy, namely 
the counterpart of the conclusions of the Hadamard-Cartan theorem in this approach, is ex- 
pressed by the fact that u and d are the only 3-dimensional 2-step nilpotent algebras over IR, 
as is well-known [31]. So, it can be claimed on rough Lie-algebraic grounds, that the relation 
of the Tsallis to the BGS entropy is similar to that of the quantum to classical mechanics. The 
role of the Planck constant H is played in the entropic case by the nonextensive parameter 
q. Elements of this analogy can also be drawn from (10) which resembles the addition of 
probabilities in quantum Physics, with the interference term xy in (10) giving rise to the novel 
properties of the Quantum as compared to those Classical Physics. 

As a third point, we allude to the well-known result that u and D are the only nilpotent 
algebras in 3 dimensions. In terms of the respective Lie groups, the statement is that the 
Heisenberg group il^ x3 and the group of Euclidean translations £> 3x3 of IR 3 are the only 
two simply-connected nilpotent groups in 3 dimensions [31]. The significance of this result is 
that it states that the Tsallis entropy is not only the simplest, but also the only alternative to 
the BGS entropy, at this level of algebraic complexity of the underlying structures. As such, 
the statement of the present paragraph is universal, and its role in the present formalism is 
analogous to that of the Cartan-Hadamard theorem of the metric approach [21]. 
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3. Geometry, volume growth, derivatives 



A. We have repeatedly used in the previous section that il^ x3 has as underlying topological 
space M 3 and, as a result, we freely switch between the Lie theoretical and the Euclidean 
(geometric) viewpoints. We justify this as follows: Consider M 3 parametrized by (x, y, z) 
and endow it with the following inner product 



(x 1 ,y 1 ,z 1 ) ■ (x 2 ,y 2 ,z 2 ) = ^ci + x 2 , yi + y 2 , z\ + z 2 + ^(xiy 2 - x 2 yi)^ 



(25) 



This inner product defines the (sub-Riemannian) Carnot-Caratheodory metric tensor gcc- 
We observe that left-translations/left- multiplications preserve the horizontal distribution 3~C 
which is the subset of ilj^* 3 generated by X, Y in (20). This can be explicitly checked for each 
one of the X, Y, Z as follows: Consider a left-translation L( a ,&, c ), with (a, 6, c) G M 3 , acting 
on (x,y,z), namely 



L iaAc) (x, y, z) = (a, b, c) ■ (x,y, z) = (a + x, b + y, c + z+ ^(ay - bx)^j 



The corresponding differential is given by the Jacobian matrix 



b a i 
~2 2 



d{x,y,z) 

Consider X as a column matrix having coordinate components X = (1,0, — |) and 



(dL) (aM) X = 1 (28) 



giving 



Moreover 



giving 



1 











1 





b 
2 


a 
2 


1 


d 
dx 




b 


( 


~2 




(26) 



(dL) {aM = d J^^vA = I I ? o I (27) 



^^ X -d-x + {-2-l)Yz W 
d b + y d , , 

x °k*> = te—rT, (30) 



{dL)( a ,b,c)X = X O L(a,b,c) (31) 

which is what we wanted to establish for X. Working in a similar manner, we find 

(dL) ( a ,b,c)Y = ^ + = Y o L (aM) (32) 
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and 



(dL)( a ^ c )Z = — = Z o L{a,b,c) (33) 

Therefore, indeed each left translation (26) is an isometry of gcc- Consider now the map 
: K 3 ->■ il 3x3 given by 

(1x2 + |xy \ 
1 y \ (34) 
1 J 

This is a Lie group isomorphism between M 3 endowed with the inner product (26) and il^ 3 
endowed with the usual matrix multiplication, as can be readily checked. Under this isomor- 
phism, we see that the basis vectors of 1R 3 map precisely to the basis matrices of (20). It is 
worth mentioning that this inner product (26) is strongly reminiscent of the warped product in 
the solvable group construction that lead to the effective hyperbolic metric in [21]. That was 
subsequently used in [22], [24] for uncovering, with the addition of simplifying assumptions, 
several properties of systems described by the Tsallis entropy. This is not a coincidence as all 
nilpotent groups, such as the Heisenberg group, are obviously solvable, hence the above sub- 
Riemannian construction expresses the same properties, as the hyperbolic metric construction 
[21], of the Tsallis entropy composition, even if seen from a different viewpoint. 

B. It is worth noticing that X, Y alongside [X, Y] span the whole u. This is unusual, from 
a Euclidean viewpoint, where all three vectors X, Y, Z are needed to determine the Lie algebra 
of translations. The difference is that for the case of Euclidean translations, the corresponding 
Lie algebra (23) is commutative, hence all the commutators between the basis vectors vanish. 
For the Euclidean case, the basis vectors X E , Y E can be chosen to lie in the "horizontal" plane 
xy, by choosing as in (24) 

Xe — 7— i Y E = —, Z E = — (35) 
ox ay oz 

By contrast, due to the nature of the Heisenberg algebra, when X, Y are horizontal, namely 
orthogonal to the center of u the vector [X, Y] will lie in the z, the "vertical" , direction. 
As above, the horizontal distribution of planes generated by X, Y is denoted by "K. Choose 

d y d y d ^ x d ^ d 

dx 2 dz ' dy 2 dz ' dz 

whose commutators satisfy the Heisenberg algebra (21). Dually, and equivalently, one may use 
co-frames and express the Heisenberg condition (21) via the vanishing of the contact form 

S = dz — —(xdy — ydx) (37) 
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where d indicates the exterior derivative. Then the kernel of S 



ker 



(38) 



determines locally K, which is the base space of the submersion R 3 — > M. 2 . 

In the case of Euclidean translations, the vectors X&, Ye (35) span a plane which is the 
leaf of a foliation of M 3 by "horizontal" planes parallel to the span of Xe, Ye and which are 
parametrized by z. By contrast, in the Heisenberg case, the vectors X, Y (36) span, locally, a 
plane which does not form a foliation of M 3 . One way to express this contrast is via Frobenius 
theorem: in the Euclidean case, the plane distribution is integrable, whereas in the Heisenberg 
case, the contact distribution is (maximally) non-integrable. More generally, the commutators 
of a nilpotent Lie algebra result in enough, linearly independent, vectors that span their whole 
linear space. 

C. Consider a curve 7 : [0, 1] — > M 3 starting at the origin 7(0) = G I 3 whose co-framing 
belongs to ker S i.e. which its everywhere tangent to the horizontal distribution IK. Moreover 
assume that the endpoint of this curve lies on the z-axis 7(1) = (0, 0, z). Consider its projection 
on IK and call it 7. Then, the area A of the disk D that 7 encloses is given by 



where A indicates the exterior /wedge multiplication of differential forms. We observe that 
dA is the differential of an element of kerS. Combining (38) and Stokes' theorem we find 
that 



Notice that z is the Euclidean distance between 7(0) and 7(1). Therefore the non-integrable 
distribution (38) gives rise to non-trivial holonomies of curves. This formalism has been exten- 
sively used in describing adiabatic phases acquired by states of a quantum system (Berry phase, 
Hannay angles etc) [32], in the description of mechanical systems with constraints [33] and in 
classical field theory in models involving gauge fields coupled to scalars or fermions [9]. In the 
current context, this result can be interpreted as stating that the generalized addition (10) is 
akin to introducing a multiplicative non-integrable phase to the composition of the probabilities 
of the two interacting sub-systems M 1; M 2 of (6). This phase expresses in a different way, the 
generalized concept of independence, usually encoded by the long-range temporal and spatial 
correlations of the distributions pu x and pm 2 (6). The entanglement of subsystems expressed 




(39) 




(40) 
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through such a non-integrable phase is one of the conjectured features of systems described by 
the Tsallis entropy. 



Moreover, looking at the process leading to (40) one can also infer the following [34]: due 
to the isoperimetric inequality in the plane 

A < ± l 2 cc(l) (41) 

where lee stands for the length of its argument with respect to the sub-Riemannian metric 
gcc- Because 7 is the projection of 7 on "K, we have 

A < ± llci.1) (42) 

According to (40) the Euclidean distance d E between the endpoints of 7(0) and 7(1) is A, 
and given that lcc{l) — dec between the endpoints we see that 

\f4xd E < dec (43) 

On the other hand 



dec < h{j)^And E (44) 

where h(j) is some function depending on the endpoints of 7. The last equation expresses 
the fact that the metric is Euclidean on "K. We conclude that in the z-direction, perpendicular 
to "K or the complement ker S, the sub-Riemannian metric gee behaves like the square root 
of the Euclidean metric g E) whereas along "K it behaves like g E . Hence the metric gcc is 
highly anisotropic from a Euclidean perspective. 

To explore consequences of this, consider two small cubes, I E £ (£> 3x3 ,g£) and Ice £ 
(il 3 ^ 3 , gcc)- Due to the previous arguments the areas of their faces along J~C scale in both 
t 2 where t > is a scaling parameter. In the direction orthogonal to 3~C the length 
of the Euclidean cube I E scales as t, as expected. By contrast, in the same direction, the 
cube Ice has length that scales as t 2 as can be seen from (43), (44). So, we arrive at the 
scaling relations 

Vol(I E ) ~ t 3 (45) 

and 

Vol(Icc) ~ t 4 (46) 

Then, essentially by definition, the Hausdorff dimension of (S 3x3 ,g_B) is 3 and that of 
(il 3 l x3 ,gcc) is 4. We observe that the underlying topological space in both of these cases 
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is M 3 whose topological dimension is 3. Hence the space (ttf? 3 ,gcc) is a fractal [35], [36]. By 
that we mean exactly a space whose Hausdorff dimension is strictly greater than its topological 
dimension [35] . Note, in passing, that the opposite inequality is impossible [37] and that the 
definition of fractals [35] does not require their Hausdorff dimensions to be non-integers (see 
however [36]). 

We arrived in a fractal structure for (ilj^ x3 , gec) by restricting the admissible curves to be 
tangent to the non-integrable distribution "K. Then the distance between any two points, de- 
fined as the infimum of all curves joining them, is quite different for the sub-Riemannian metric 
gec than for the Euclidean metric g£. By this realization we come back to the origins of the 
Tsallis entropy: the introduction of the Tsallis entropy relied on the multi- fractal formalism [3], 
[4]. Naturally it should encode some of its properties. Probably the simplest such manifesta- 
tion is that the concept of "independence" adopted by the Tsallis entropy, via the generalized 
addition (10), gives rise to a fractal structure on the M 3 when compared to the Euclidean struc- 
ture induced by the BGS entropy. The above considerations pertaining to the dimension are 
very special cases of the ball-box theorem and of the general Bass/Mitchell/Gershkovich/Nigel- 
Stein-Waigner [33], [34] formula giving the homogeneous dimension of sub-Riemannian spaces 
at their regular points. 

D. As a result of the isomorphism ^ (34), one can equivalently define the sub-Riemannian 
metric gec on ^r x3 to be the Riemannian metric with respect to which the basis vectors 
(36) are orthonormal [33], [34]. It turns out that g cc is the restriction of the Euclidean metric 
gE on the horizontal distribution "K spanned by X,Y (36). In the Lie algebra language the 
argument is re-expressed as follows: by inverting (36) we get 

£- = X+*Z, ^ = Y- X -Z (47) 
dx 2 ' dy 2 y 1 



Let 



After using (48), we have 



d d d 

W = w iTT + w ^7T + W 37T ( 48 ) 
ox ay oz 



W = w 1 X + w 2 Y+(^ z -)^xw 2 -yw 1 ) S jZ (49) 

and assuming that Se(W, •) G kerS we get from (49) that the coefficient of Z vanishes, thus 
giving the sought after 

W = w ± X + w 2 Y (50) 
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The Carnot-Caratheodory distance between two elements A, B of iljj^ 3 is defined as the 
length lcc(A, B) of the shortest horizontal curve 7 : [0, 1] — >■ "K joining A,B, namely by 

d C c(A,B)=wi{l 1 (A,B)} (51) 

7 

One can verify that any two points on iij^ x3 can be connected by such a horizontal curve 
of finite length with respect to gcc as follows [34]: consider the one parameter subgroups 
Rx, Ry of right translations of Hr X3 generated by X,Y in (20). These two single parameter 
families are tangent to "K. Since X, Y are the generators under the commutator of u, the 
subgroups Rx, Ry they generate, also generate the whole of ilj^ 3 . Therefore any two 
points of iljj^ 3 can be connected by a piecewise smooth curve, whose every smooth piece is 
generated either by Rx or by Ry- This is clearly impossible in (M 3 ,g^). It does however 
happen in (iljjj* 3 , gcc) because "K allows for curves that "turn and twist" sufficiently to 
make it possible. The general statement substantiating this pictorial explanation, applicable 
to any sub-Riemannian space with vector fields having iterated Lie brackets spanning their 
whole tangent bundles, is due to Chow and Rashevskii [33], [34]. From the sub-Riemannian 
viewpoint, the Tsallis entropy composition (7), (10) essentially amounts to the existence of a 
horizontal distribution 3~C. 

E. It may also worth briefly commenting on the uniqueness of gcc- Looking in the above 
construction, we see that the Tsallis entropy composition property (7), (10) actually determines 
the map (11). Upon embedding the image of (11) into (14), we could use any metric we wished 
on (14) to determine the properties of interest, as the construction itself does not instruct us 
how to choose a "natural", from the physical viewpoint, metric. One can even attempt to work 
outside the realm of Riemannian spaces, with Finslerian metrics, for instance. Then it is not ob- 
vious that the results obtained, certainly the ones pertinent to the concept of "independence" of 
subsystems of a physical system, would not be dependent of such a construction. In short, apart 
from its familiarity and simplicity, why is the use of gcc, or equivalently, of the inner prod- 
uct (25) optimal for drawing conclusions about physical systems? To make things somewhat 
manageable, we constrain ourselves to sub-Finslerian metrics, in analogy with sub-Riemannian 
metrics, namely to distance functions d\ and d 2 calculated along horizontal distributions 
which are arising from (Banach) norms 1 1 • 1 1 . Consider two such sub-Finslerian metrics 1 1 • 1 1 1 
and ||-||2 giving rise to di and d 2 respectively. The physical properties of the systems that 
we have in mind do not change, as will also be seen in the sequel, if the distances di, d 2 do 
not change all that much, that is if such distances change, but in a uniformly controlled manner. 
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Using (35) and (38), this is expressed precisely by stating that the physical properties 
of interest associated with the concept of "independence" do not change if the two distance 
functions d±, d 2 are bi-Lipschitz equivalent, namely if there are constants C\ > 0, c 2 > 
such that 

— d 2 (x,y) < di(x,y) < c 2 d 2 (x,y), — d x (x, y) < d 2 (x, y) < c x d x {x, y), x,y Gl 3 (52) 
c 2 Ci 

However, it turns out that all sub-Finslerian spaces of interest in this subsection are bi-Lipschitz 
equivalent, so it makes no difference for the physical aspects of the system under study which 
particular metric we choose to work with. As such we choose the most flexible and familiar 
ones, namely the sub-Riemannian ones arising from inner products such as (25). 

F. Another point worth noticing is that u has a simple dilation property mirroring the 
corresponding symmetry of M 3 . The dilations 5 t , t e M+ of 2) 3x3 are a single parameter 
family of diffeomorphisms given by 



(53) 



and they are isometries of (D 3x3 ,g E )- In i^ x3 , due to the existence of the generalized 
addition in (12), such a simple behavior of the metric under dilations is not possible. One 
observes however that the more general, graded re-scalings 

2 z \ 

(54) 

result in the following simple-looking behavior for the sub-Riemannian distance function 

d cc (St(A), 5 t (B)) = t d cc (A, B), A, Be il 3x3 (55) 

This can be checked explicitly from (54). We have to pay attention to that d C c is calculated 
along curves that are tangent to "K which re-scale linearly under 5 t . Hence the corresponding 
distance function dec also re-scales linearly under the above dilations. The fact that the 
central element scales quadratically with t 2 , and not linearly, is not really important in this 
argument as its behavior is already taken care of via the bracket-generating condition (21). On 
the other hand, using the dilations (54), we can re-establish the previously reached result (46) 
that the small balls of gec have volumes scaling as £ 4 hence that the Hausdorff dimension 
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of il^* 3 is actually 4 instead of the naively expected, at a first glance, 3. 

G. Consider the Heisenberg group il^ x3 endowed with a left-invariant Riemannian metric 
g. Let 

St = \ 5* t (g) (56) 

where 5^ is the dilation of the metric induced from (54). The corresponding distance function 
d t converges to dec f° r t — > oo in the pointed Gromov-Hausdorff sense [38], [39]. Moreover 
(iij^ 3 , d t ) is isometric to (ilj^ 3 , \d). Hence the asymptotic cone [40], [41] of ilj^ 3 endowed 
with the Riemannian distance function d is ilj^ 3 endowed with the sub- Riemannian distance 
function dec- This is a geometric way of looking at the generalized addition (7), (10) induced 
by the Tsallis entropy. From the Euclidean/Riemannian perspective, which is expressed via the 
usual addition, the role of (7), (10) it two-fold: it gives a rise to a non-integrable distribution 
"K essentially due to the quadratic term of (7), (10). Naturally the space remains Riemannian 
for any value of the dilation parameter. In the large dilation limit though, the curves that are 
perpendicular to 'K are suppressed and distance calculations are only allowed along curves 
that are tangent to !K. This expresses geometrically that the difference between the usual 
and the generalized additions may be of little consequence for small values of numbers that 
are summed, but it becomes progressively important and eventually dominant as these num- 
bers increase. Eventually the inequality in the degrees of the terms of the generalized addition 
makes it more akin to multiplication than to addition. In other words, if someone thinks about 
addition of entropies for independent systems for the BGS case, this person should think in 
multiplicative terms when discussing the concept of "independence" for systems described by 
the Tsallis entropy. 

The dilations, being isometries, also reflect the self-similar behavior of (itjj^ 3 , gec) & t 
different scales. Given the origin of the Tsallis entropy in the multi-fractal formalism, this 
behavior should not come as a surprise. This self-similarity at different scales shows that the 
Heisenberg group behaves in a roughly similar way to the many other fractals encountered in 
Physics, almost all of which have non-integer Hausdorff dimension [35] , [36] . By considering the 
limits, the dilations allow us to explore the local sub-Riemannian geometry of the Heisenberg 
group by examining its asymptotic cone on which one would expect to be able to use several 
simplifications in the models under study. The fact that dilations provide isometries between 
the different cones has an additional implication: the Heisenberg group is purely 2-unrectifiable. 
Notice that it is not purely 1-unrectifiable as most of curves in it, not "escaping" to infinity, 
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have finite length. Being 2-unrectifiable [42] means that all rectifiable sets of Hausorff dimen- 
sion 2, namely all sets of finite area, have 2-dimensional Hausdorff measure zero with respect 
to the sub- Riemannian metric gcc- This expresses in a precise way that the overwhelming 
majority of 2-dimensional sets in the Heisenberg group ilj^ x3 are unrectifiable. Moreover the 
ones that are rectifiable, are insignificant in number (a rare exception), by comparison. This is 
in accordance with our intuition where fractals are so convoluted that (depending on the par- 
ticular fractal) lengths, areas etc are infinite when measures are considered at a very fine scale. 
Pictorially and for a comparison, it may worth seeing this observation as a multidimensional 
extension of the von Koch snowflake curve, any segment of which has infinite length hence any 
segment of which is purely 1-unrectifiable [36]. Again, this behavior induced by the composi- 
tion property of the Tsallis entropy, equivalently by the generalized definition of independence, 
can be traced back to the origins of the Tsallis entropy in the multii-fractal formalism [3], [4]. 
Thus once more we are brought back full circle to the set of ideas and structures behind the 
introduction of the Tsallis entropy. 

H. Given the unrectifiable nature of subsets of the Heisenberg group discussed the previous 
subsection, it may be of interest to examine to what extent structures induced by the Tsallis 
entropy are differentiable. This allows us to inquire to what extent first order calculus can 
be developed for such sets in order to probe their infinitesimal behavior. It also allows us to 
wonder about the degree of smoothness of these structures and far more importantly, especially 
for Quantum Gravity, to what extent such smoothness is a fundamental or an emergent phe- 
nomenon [14], [15], [42] - [44]. 

One can always wonder what is a local analogue of the ordinary derivative which somehow 
reflects the composition properties of the Tsallis entropy. Such a derivative has been constructed 
for Heisenberg (actually for stratified/ Carnot) groups in [45]. Pansu actually showed that a 
tangent map of a Lipschitz map between sub- Riemannian spaces exists, is unique and is a group 
homomorphism of the tangent cones which is equivariant with the corresponding dilations. This 
happens for almost all points of the Heisenberg group, except possibly for a set of Hausdoff 
measure zero. Hence there is a unique differentiable structure associated with a Lipschitz map 
which is best adapted to the dilation properties of the tangent cones to the Heisenberg group 
[45]. To be concrete, let us consider a map (f> : ilj^ x3 — > R™ from the Heisenberg group to the 
Euclidean space. Let the left translations on either group be indicated by abuse of notation 
by £ and let the corresponding dilations be indicated by (54) and (53) respectively. The 
Euclidean space W 1 is seen here as a 1-step nilpotent (Abelian) group. The corresponding 
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tangent map P<p given by Pansu's theorem [45] at A e il R x is 



(Pcf>)(A) 



lim 6 t -i o £, 1 , o o £ A o 5 t A 

t—>00 WA) 



(57) 



Let's be a bit more concrete for a couple of special cases of interst to the BGS and Tsallis 
entropies. First consider the map (p : D 3x3 — y £> 3x3 between the diagonal subgroups of R 3x3 
which represent Euclidean translations. This map is generally given by 
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Consider the matrix A e £) 3x3 in a neighborhood of which the Pansu derivative will be 
calculated and the corresponding variation matrix to be X e S? 3x3 . These have the form 




a; 
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tx 

tz 



cp(a + tx) 
(p(b + ty) 
ip(c + tz) 



with all entries being in HL The dilation (53) of X gives 
and the left translation of A by S t X has image under ip given by 
Since £> 3x3 is an Abelian group under addition, 
which gives 

([^(A)]-V(A + W) 

Upon taking the limit t — > we find 

(P<p)(A) 
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(59) 



(60) 
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So, we see that in the particular case of (64) its Pansu derivative is the Euclidean gradient, a 
fact compatible with our intuition about derivatives. 



As a second example, consider the map $ : il™ x — > £) 3x3 given by 
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(66) 



in analogy with (59). Then following the steps of the previous example, we have the dilations 
(54) which give 

$(a + tx) 
$(b + ty) 



$(A5 t X) = 
Moreover, since £) 3x3 is Abelian 

MAT 1 



(67) 











$(c + tz) 




(68) 



Let's confine ourselves to a neighborhood of the identity in Hr x3 . This is not a loss of generality 
as we can recover the behavior of any quantity in the neighborhood of any point of ilj^* 3 by 
acting by left translations on a neighborhood of the identity. Assume then that a = b = c = 
in (67). Then (68) gives $(a) = $(&) = $(c) = 0, which results in 
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The Jackson derivative is defined for a function h : 
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v ' t-*i tx — x 

when the limit exists. Given this definition, we see that the Pansu derivative of the map $ at 
the origin of ii^ x3 is given by the Jackson derivatives of its entries 
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It is an interesting observation by Abe [46] that the BGS entropy can be written as 




and that a similar relation holds for the Tsallis entropy where the ordinary has been substituted 
by the Jackson derivative 




The sub-Riemannian framework established above for the Tsallis entropy clarifies why (73) is 
the natural counterpart of (72). It also shows the important, but distinct, role of dilations in 
both cases. Moreover it provides a unified geometric framework for both the ordinary and the 
Jackson derivatives which are seen as special case of the Pansu derivatives of maps between 
sub-Riemannian spaces. 

I. Another conclusion of the Heisenberg group construction given above is the following: recall 
[38] - [40] that a Riemannian manifold M has polynomial (power-law) growth when the volume 
Vol : M — > M + of all balls B r (x), x G M of radius r does not increase faster than some 
power of r, namely, when there is an p G K such that 

Vol(B r (x)) < r p (74) 

Naturally, all Riemannian manifolds M are infinitesimally Euclidean, so for small r, we have 

lim Vol(B r (X)) < const r n (75) 

where n is the topological dimension of M. So, for such M, the above definition may become 
non-trivial when one considers the limit r — > oo. This amounts to taking the asymptotic 
viewpoint and wondering how the manifold looks at large scales (from large distances). This 
asymptotic viewpoint does not distinguish between M itself and a set of points in M having 
similar large-scale geometric characteristics. This geometric indistinguishability is encoded in 
the concept of quasi-isometry [38] - [41]. A map / : Mi — > M 2 between two metric spaces 
Mi, M 2 with corresponding distance functions d\ and d 2 is a quasi-isometry, if there are 
constants C\ > 0, c 2 > such that 

— d 1 (x,y)-c 2 < d 2 (x,y) < a di(x,y) + c 2 (76) 

Cl 

for all x, y G M\. In words, this means the following: we discretize M\ and M 2 by choosing 
nets with respect to d\ and d 2 respectively. By doing so, we ignore all small-scale structures 



19 



in Mi and M 2 . This is the role the constant c 2 in (76). The two nets are bi-Lipschitz 
equivalent. This makes precise that di(x,y) should not be distorted too much by /. The 
maximal distance distortion is determined by the constant c\. In this, large scale, treatment 
we are interested in quasi-isometrically invariant features of the underlying structures of Mi 
and M 2 . In the case of interest to us, we pick as a net in il^ x3 , the discrete Heisenberg group 
il 3 / 3 having integer entries 



(3x3 




x,y,z eZ } (77) 



This is a finitely generated group, and mirroring the case of real coefficients, it is also nilpotent. 
Choose a set of generators % of il| x3 by picking 



7i = 1 , T 2 = 1 1 (78) 





The generating set X should be made symmetric by including in it the inverses of its generators 
which are 

/ 1 -1 \ / 1 \ 

(71)- 1 = 1 , {T 2 y l =01-1 (79) 
\ 1 / \ 1 / 

Express any element U of il| x3 in terms of the generators of the symmetrized set % by the 
word and reduce 

U = (T 1 ) kl (T 2 ) h2 ---(T 2 ) ki (80) 

Such a representation of an element of il^ x3 in terms of elements of % is not unique. The 
length of the above word representing U is by definition |fci| + l&^l + • • • + \h\- The norm ||£7|| 
of U is the minimum length of all words that express U in terms of elements of 1 [39] . Let 
B r indicate the closed ball with respect to the generating set 1 having as center the identity 
element l 3x3 of it 3 / 3 and radius r, namely 

B r = {UeUr 3 : \\U\\<r} (81) 

The volume of B r is defined with respect to the counting measure, hence it is the cardinality 
of the underlying set. The group il 3 / 3 has polynomial growth if there are two constants c > 0, 
d > such that 

Vol B r < cr d (82) 
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One can prove that the property of polynomial growth does not depend on the choice of the 
generating set X. indeed, changing the generating set will only affect the value of c. The 
value of the exponent d depends only on the group il^ x3 and not on the choice of the generating 
set X. Hence any change of the generating set will amount to a quasi-isometry of il^ x3 but 
the polynomial growth property is invariant under quasi-isometries. A fundamental theorem of 
Milnor [47], Wolf [48], Tits [49], Gromov [39] (see also the more recent [50], [51]) states that a 
finitely generated group has polynomial growth if and only if it is virtually nilpotent, namely 
if contains a nilpotent subgroup of finite index. In our case of interest, the "virtual" adjective 
is not actually needed as il^ x3 is nilpotent itself. Not too surprisingly, the growth rates of the 
balls in il^ x3 and in il^ x3 are equivalent. Hence the growth rate of the balls in ilj^ x3 is also 
polynomial. Since these structures reflect the composition property of the Tsallis entropy. At 
this point we make the very strong assumption, that such a behavior is present at the level of 
phase space dynamics, and it is not emergent from the statistics. Then this polynomial/power- 
law growth of balls in the Heisenberg group ilj^ 3 actually reflects a similar behavior of the 
volume in the configuration/phase space of the underlying microscopic system whose statistical 
properties are encoded by the Tsallis entropy. Seen from a different viewpoint, this polynomial 
behavior is ascribed to strong spatial and temporal correlations that constrain the growth rate 
of the volume function in the configuration/phase space of the system. This is a result that was 
reached in [52], [53] for binary systems and was generalized in [24] for Riemannian manifolds. 
The present conclusion, reached through a different path when compared to the previous works, 
has the advantage that it is equally applicable to both cases of discrete and continuous systems 
at the same time. 

J. An important question that someone can ask is how different actually the Tsallis entropy 
is from the BGS entropy. From the above constructions it appears that this amounts to asking 
how different are the resulting sub-Riemannian from the corresponding Riemannian spaces. 
The answer is clearly a matter of quantifying "far" and "close". To do so in metric way we 
use bi-Lipschitz maps. For two spaces that are bi-Lipschitz, the distances are not distorted 
too much, so such spaces can be considered as equivalent if someone observes them from some 
distance. So the question is whether there a bi-Lipschitz map between the Heisenberg group 
iljj^ 3 and M 3 . By using Pansu's generalization of Rademacher's theorem to mappings between 
sub-Riemannian spaces, Semmes [54] answered the question in the negative. He proved [54] that 
there is no bi-Lipschitz embedding from an open subset of a Heisenberg group to a Euclidean 
space. In that sense the underlying geometry of the Tsallis entropy is quite different from that 
of the Euclidean space which is induced by the composition of the BGS entropy. Intuitively the 
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picture is clear: if such a bi-Lipschitz map existed then it would have to be almost everywhere 
differentiable. Its Pansu differential would have to be a group homomorphism. The approxi- 
mating maps would have to be also bi-Lipschitz due to the homogeneity of the blow-up cones. 
Therefore such maps would have to be injective which is impossible because the whole center 
of the Heisenberg group would be mapped to a point in the Euclidean space in order to have 
a group homomorphism. This result can be extended when the target is R n for any nGN 
and can be seen as the lack of an analogue of Nash's embedding theorem for sub-Riemannian 
spaces. As a by-product one sees that the Heisenberg group is purely 1-unrectifiable as was 
also noticed in subsection G above. 



4. Discussion and outlook 

We presented above a sub-Riemannian construction encoding the effects of the generalized 
addition induced by the Tsallis entropy composition property (7), (10). We showed how fea- 
tures of this construction can be used to justify known properties of the Tsallis entropy. We 
would like to point out that all the mathematical facts we used, except the particulars pertain- 
ing to the Tsallis entropy, are well-known even classical to practitioners of Geometry. What we 
have attempted in this work, was by barely scratching the surface of the existing knowledge of 
sub- Riemannian/Carnot- Carat heo dory geometry, to provide the connections with and to show 
how some of the the Tsallis entropy properties can be organized from a sub-Riemannian per- 
spective. It turned out that some previously known, but not obviously inter-connected, facts 
about the Tsallis entropy fit neatly in the sub-Riemannian framework, which provides an ef- 
fective geometric language that expresses in a unified way many aspects of the Tsallis entropy. 
Moreover, the Heisenberg group perspective highlights the universality and uniqueness features 
that distinguish the Tsallis entropy from other entropic functionals that have been developed 
over the last few decades [4]. 

One can ask whether there is any connection between the sub-Riemannian construction 
presented here and the the hyperbolic map r q used in [20], [21], [22], [24]. The answer to that 
is affirmative and is provided by the boundary construction of (Gromov)-hyperbolic spaces and 
the visual/Tits metrics on it, for non-compact symmetric spaces of rank 1 [38] - [40], [55]. We 
will elaborate upon these constructions, point out their connections with the Tsallis entropy 
and explore their potential physical implications in a future work. 
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